19 research outputs found

    MQALD: Evaluating the impact of modifiers in question answering over knowledge graphs.

    Get PDF
    Question Answering (QA) over Knowledge Graphs (KG) aims to develop a system that is capable of answering users’ questions using the information coming from one or multiple Knowledge Graphs, like DBpedia, Wikidata, and so on. Question Answering systems need to translate the user’s question, written using natural language, into a query formulated through a specific data query language that is compliant with the underlying KG. This translation process is already non-trivial when trying to answer simple questions that involve a single triple pattern. It becomes even more troublesome when trying to cope with questions that require modifiers in the final query, i.e., aggregate functions, query forms, and so on. The attention over this last aspect is growing but has never been thoroughly addressed by the existing literature. Starting from the latest advances in this field, we want to further step in this direction. This work aims to provide a publicly available dataset designed for evaluating the performance of a QA system in translating articulated questions into a specific data query language. This dataset has also been used to evaluate three QA systems available at the state of the art

    Bi-directional LSTM-CNNs-CRF for Italian Sequence Labeling and Multi-Task Learning

    Get PDF
    In this paper, we propose a Deep Learning architecture for several Italian Natural Language Processing tasks based on a state of the art model that exploits both word- and character-level representations through the combination of bidirectional LSTM, CNN and CRF. This architecture provided state of the art performance in several sequence labeling tasks for the English language. We exploit the same approach for the Italian language and extend it for performing a multi-task learning involving PoS-tagging and sentiment analysis. Results show that the system is able to achieve state of the art performance in all the tasks and in some cases overcomes the best systems previously developed for the Italian

    Semantically-Aware Retrieval of Oceanographic Phenomena Annotated on Satellite Images

    Get PDF
    Scientists in the marine domain process satellite images in order to extract information that can be used for monitoring, understanding, and forecasting of marine phenomena, such as turbidity, algal blooms and oil spills. The growing need for effective retrieval of related information has motivated the adoption of semantically aware strategies on satellite images with different spatiotemporal and spectral characteristics. A big issue of these approaches is the lack of coincidence between the information that can be extracted from the visual data and the interpretation that the same data have for a user in a given situation. In this work, we bridge this semantic gap by connecting the quantitative elements of the Earth Observation satellite images with the qualitative information, modelling this knowledge in a marine phenomena ontology and developing a question answering mechanism based on natural language that enables the retrieval of the most appropriate data for each user’s needs. The main objective of the presented methodology is to realize the content-based search of Earth Observation images related to the marine application domain on an application-specific basis that can answer queries such as “Find oil spills that occurred this year in the Adriatic Sea”

    Ghigliottin-AI @ EVALITA2020: Evaluating Artificial Players for the Language Game “La Ghigliottina”

    Get PDF
    Evaluating Artificial Players for the Language Game “La Ghigliottina” (Ghigliottin-AI) task is one of the tasks organized in the context of the 2020 EVALITA edition, a periodic evaluation campaign of Natural Language Processing (NLP) and speech tools for the Italian language. Ghigliottin-AI participants are asked to build an artificial player able to solve “La Ghigliottina”, namely the final game of an Italian TV show called “L’Eredità”. The game involves a single player who is given a set of five words unrelated to each other, but related with a sixth word that represents the solution to the game. Fourteen teams registered to Ghigliottin-AI. Nevertheless, only two teams submitted their run. In order to evaluate the submitted systems, we rely on an API base methodology, via a Remote Evaluation Server (RES). In this report we describe the Ghigliottin-AI task, the data, the evaluation and we discuss results

    From exercise intolerance to functional improvement: The second wind phenomenon in the identification of McArdle disease

    Get PDF
    McArdle disease is the most common of the glycogen storage diseases. Onset of symptoms is usually in childhood with muscle pain and restricted exercise capacity. Signs and symptoms are often ignored in children or put down to 'growing pains' and thus diagnosis is often delayed. Misdiagnosis is not uncommon because several other conditions such as muscular dystrophy and muscle channelopathies can manifest with similar symptoms. A simple exercise test performed in the clinic can however help to identify patients by revealing the second wind phenomenon which is pathognomonic of the condition. Here a patient is reported illustrating the value of using a simple 12 minute walk test.RSS is funded by Ciências sem Fronteiras/CAPES Foundation. The authors would like to thank the Association for Glycogen Storage Disease (UK), the EUROMAC Registry funded by the European Union, the Muscular Dystrophy Campaign, the NHS National Specialist Commissioning Group and the Myositis Support Group for funding

    Misdiagnosis is an important factor for diagnostic delay in McArdle disease

    Get PDF
    Diagnosis of McArdle disease is frequently delayed by many years following the first presentation of symptoms to a health professional. The aim of this study was to investigate the importance of misdiagnosis in delaying diagnosis of McArdle disease. The frequency of misdiagnosis, duration of diagnostic delay, categories of misdiagnoses and inappropriate medical interventions were assessed in 50 genetically confirmed patients. The results demonstrated a high frequency of misdiagnosis (90%, n = 45/50) most commonly during childhood years (67%; n = 30/45) compared with teenage years and adulthood (teenage: n = 7/45; adult n = 5/45; not known n = 3/45). The correct diagnosis of McArdle disease was rarely made before adulthood (median age of diagnosis 33 years). Thirty-one patients (62%) reported having received more than one misdiagnosis; the most common were “growing pains” (40%, n = 20) and “laziness/being unfit” (46%, n = 23). A psychiatric/psychological misdiagnosis was significantly more common in females than males (females 6/20; males 1/30; p < 0.01). Of the 45 patients who were misdiagnosed, 21 (47%) received incorrect management. This study shows that most patients with McArdle disease received an incorrect explanation of their symptoms providing evidence that misdiagnosis plays an important part in delaying implementation of appropriate medical advice and management to this group of patients.The authors would like to thank Mr Andrew Wakelin for his great and inspiring work. The authors would also like to thank AGSD-UK, CAPES Foundation, Muscular Dystrophy Campaign and the Euromac Registry for their support

    EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020

    Get PDF
    Welcome to EVALITA 2020! EVALITA is the evaluation campaign of Natural Language Processing and Speech Tools for Italian. EVALITA is an initiative of the Italian Association for Computational Linguistics (AILC, http://www.ai-lc.it) and it is endorsed by the Italian Association for Artificial Intelligence (AIxIA, http://www.aixia.it) and the Italian Association for Speech Sciences (AISV, http://www.aisv.it)

    The UNIBA System at the EVALITA 2018 Italian Emoji Prediction Task

    Get PDF
    This paper describes our participation in the ITAmoji task at EVALITA2018 (Ronzano et al., 2018). Our approach is based on three sets of features,i.e. micro-blog and keyword features, sentiment lexicon features and semantic features. We exploit these features to train and combine several classifiers using different libraries. The results show how the selected features are not appropriate for training a linear classifier to properly address the emoji prediction task

    Bi-directional LSTM-CNNs-CRF for Italian Sequence Labeling and Multi-Task Learning

    No full text
    In this paper, we propose a Deep Learning architecture for several Italian Natural language processing tasks based on a state of the art model that exploits both word- and character-level representations through the combination of bidirectional LSTM, CNN and CRF. This architecture provided state of the art performance in several sequence labeling tasks for the English language. We exploit the same approach for the Italian language and extend it for performing a multi-task learning involving PoS-tagging and sentiment analysis. Results show that the system is able to achieve state of the art performance in all the tasks and in some cases overcome the best systems previously developed for the Italian
    corecore